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ABSTRACT 

We investigate the dependence of dark matter halo clustering on halo formation time, density pro¬ 
file concentration, and subhalo occupation number, using high-resolution numerical simulations of 
a LCDM cosmology. We confirm results that halo clustering is a function of halo formation time 
at fixed mass, and that this trend depends on halo mass. For the first time, we show unequiv¬ 
ocally that halo clustering is a function of halo concentration and show that the dependence of 
halo bias on concentration, mass, and redshift can be accurately parameterized in a simple way: 
b(M,c\z) = b(M\z)b Cvit (c\M/M *). Interestingly, the scaling between bias and concentration changes 
sign with the value of M/M*: high concentration (early forming) objects cluster more strongly for 
M< M*, while low concentration (late forming) objects cluster more strongly for rare high-mass ha¬ 
los, M> M*. We show the first explicit demonstration that host dark halo clustering depends on the 
halo occupation number (of dark matter subhalos) at fixed mass, and discuss implications for halo 
model calculations of dark matter power spectra and galaxy clustering statistics. The effect of these 
halo properties on clustering is strongest for early-forming dwarf-mass halos, which are significantly 
more clustered than typical halos of their mass. Our results suggest that isolated low-mass galaxies 
(e.g. low surface-brightness dwarfs) should have more slowly-rising rotation curves than their more 
clustered counterparts, and may have consequences for the dearth of dwarf galaxies in voids. They 
also imply that self calibrating richness-selected cluster samples with their clustering properties might 
overestimate cluster masses and bias cosmological parameter estimation. 

Subject headings: cosmology: theory — dark matter — galaxies: halos — galaxies: formation - 
large-scale structure of universe — methods: numerical 


1. INTRODUCTION 

The spatial distribution of galaxies is now well es¬ 
tablished to be dependent on several of their internal 
properties: stellar mass, luminosity, color, star formation 
rate, Hubble type, and several others fe.g..lHu bblelll936t 
lDresslerlll980t iNorberg et alJl200il iZehavi et al.ll201lT 
In the current paradigm for galaxy formation, this can 
be understood as a combination of the fact that dark 
matter halos with different masses and formation histo¬ 
ries cluster differently and host different galaxy popula¬ 
tions. A full understanding of these trends is one of the 
primary goals of modern cosmology, as it is likely to pro¬ 
vide insight into the physical process that govern galaxy 
formation and aid in the use of observed galaxy cluster¬ 
ing as a probe of fundamental cosmological parameters. 

Many models of galaxy formation and methods of cal¬ 
culating galaxy clustering statistics make two related 
simplifying assumptions. The first is that the num¬ 
ber and properties of galaxies within a host dark mat¬ 
ter halo depend solely on the mass of the host, inde¬ 
pendent of halo environment or other properties of the 
dark halo. The second is that the clustering properties 
of dark matter host halos are a function only of their 
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masses and that dark matter halos are otherwise igno¬ 
rant of their larger environments. The latter assumption 
forms part of the b asis of the excurs ion-set formalism for 
galaxy clustering llBond et al 3EM1 , at le ast in its sim¬ 
plest and most common implem entation llLacev fc AUkj 
(Somerville fe Kolatt I1999H. This implementation 
assumes that halo formation is a Markov process with 
no correlations between different spatial scales, which 
then implies that future halo accretion is independent 
from past history, and that halo histories are in d epen¬ 
den t of environment (see a lso discussion in iWhitelli DDfil 
and iSheth k. Tormenll2004H . 

The first place that these assumptions are made is 
in semi-analytic models of galaxy formation. A basic 
assumption of the technique is that the properties of 
galaxies depend only on the mass and formation time 
of the host halo. In many implementations, galaxy 
clustering is calculated by filling simulated halos at 
a fixed redshift with halo formation histories that 


are calculated analytically usin 

g the excursion-set 

formalism (e.g. Kauffmann et all 

199A 

Benson et alj 

2000): (Wechsler et al. 2001: IZentner et al. 

l200§. By 


construction, any dependence of galaxy properties on 
environment in these models must come only from 
the extent to which they populate halos of different 
masses. Note that this assumption is avoided to some 
extent in many modern implementations which use 
halo merging his tories extracted directly from N-bodv 
simulations ( e.g. iSnrmggl ~et~alll200H IHellv et allB 
[Hatton et all 12003 ISnringel et al.l 120051 IKang et al] 

120051 iCroton et all 120051 iDe Lucia et all l200fit 
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iBower et al .; 1200 5). In these implementations semi- 
analytic recipes depend on mass and formation history 
as before, but correlations between the galaxy properties 
and environment may now come from the extent to 
which they populate halos of different masses and 
formation histories. If there are physical effects that 
otherwise depend on the larger scale environment these 
would not be included. 

The second place these assumptions are com¬ 
monly made is in t he standard halo model of 
galax y clustering (e.g., ISeliakl feOOOt iPea cock fe Smitbl 

200alScoccimarro et all200lHBerlind fc Weinberel200H 

Bullock. Wechsler. & Somerville! l2QQ2t iCoorav fe Shethi 

2002 1. The halo model is a framework for calculating 
clustering statistics of objects by associating them with 
dark matter halos, which have well-studied abundances 
and clustering properties. The clustering statistics of a 
galaxy population can be computed after specifying the 
clustering properties of the dark matter halos, the prob¬ 
ability distribution for the number of galaxies in a host 
halo as a function of halo mass, and the distribution of 
these galaxies within their host halos. In principle, the 
halo bias, the probability distribution for the number of 
galaxies in a halo of fixed mass, and the spatial distri¬ 
bution of galaxies within their host halos can depend on 
other properties of the halo, but the standard assumption 
is tha t they depend only on halo mass. I Abbas Rr. Shethi 
f 20051) have recently described a modification of the halo 
model th at incorporates d epend encies on local densities. 

fLenrson fc KaiiHinaim C1 9991 ) tested the assumption 
that halo properties are independent of environment us¬ 
ing numerical simulations of cosmological structure for¬ 
mation and found no dependence of halo clustering on 
formation time or on several other properties of the dark 
halos. More recent theoretical studies have also indicated 
that any trends of halo occupation on environment have 
only a relatively small net effect on large-scale cluster¬ 
ing statistics, at least at the level that can b e measured 
in relatively small computational volumes llBerlind et alJ 
120031 iZentner et al.ll20Q.5t lYoo et all 120051). However, a 
recent study bv lAvila-Reese et al.1 ( 2005 ) found environ¬ 
mental trends with halo concentration, spin, shape, and 
internal angular momentum. 

An early indication of a relationship between halo 
formation histories and halo cluster in g pro perties was 
dem onstrated by iSheth fc Tormenl 12004 see also 
1 Wechsler! l200l . Recently, IGao. Springe!, fc Whitel 
( 2005 ) showed convincingly that the clustering of low- 
mass halos is a strong function of their formation times. 
In this mass regime, early-forming halos are signifi¬ 
cantly more c l ustered than their late-forming counter¬ 
parts. lHarker et alJ (2006) provided confirmation of 
these results using statistics of marked point distribu¬ 
tions similar to those that we employ below. The trend 
they identified is strongest for low-mass halos, which have 
only recently been well resolved in numerical simulations. 

Many properties of dark matter halos correlate well 
with halo formation time, so it is natural to inves¬ 
tigate whether these trends with formation time ex¬ 
tend to other halo properties. In particular, it is nat¬ 
ural to expect trends with t he concentrations of d ark 
halo de nsity profiles, which iNa.varro. Frenk. & ~Whitc 
(Il997t) . I Wechsler et all 120021) . and iZhao et all 120031) 
have shown to correlate well with halo formation 


time. In addition, several studies have convincingly 
demonstrated that the number of satellite halos within 
a host halo of fixed mass_is a function of halo 


formation time IGao et al. 

2004 Zentner et al. 20051; 

Ivan den Bosch et aTI 20051 

Taylor ^ B 

abull 20 Q3) and 

halo concentration IZentner 

et al 20051 

. If satellite ha- 


los are to be associated with satellite galaxies in groups 
and clusters, this indicates that the probability distribu¬ 
tion for the number of galaxies per halo, known as the 
halo occupation distribution (HOD), is also a function of 
these variables and, by extension, may likely be a func¬ 
tion of halo environment. 

Given the correlations between these halo proper¬ 
ties and formation time, it is interesting to determine 
whether or not they relate to environment in a similar 
way. This does not have to be the case; if the relations 
between these halo properties and formation time are 
themselves a function of environment, their trends with 
clustering could in principle be quite different. Moreover, 
as we discuss below, concentration and halo occupation 
have much more direct consequences for the halo model 
and its application to constraints on cosmological pa¬ 
rameters, and may have a more direct impact on tests 
of galaxy clustering. We focus our study on these halo 
properties. 

In the present study we use two large, high-resolution 
dissipationless cosmological simulations to study the de¬ 
pendence of the clustering of dark matter halos on halo 
properties other than mass. We show that halo clus¬ 
tering depends on halo formation time, and present a 
clear demonstration that halo clustering is a function of 
both halo concentration and halo occupation number. 
We show how these properties change with halo mass, 
and present the first investigation into how they change 
with redshift. We present a simple fitting formula for our 
concentration-dependent clustering results that will en¬ 
able estimates of the strength of these effects for various 
applications in the context of the halo model. We discuss 
several implications of these results for outstanding issues 
in galaxy formation, including the clustering of dwarf 
galaxies and the concentrations of low surface-brightness 
galaxies, and for the estimation of cosmological parame¬ 
ters, including self-calibration of cluster masses. 

We begin with a description of our methods in § Q 
Specifically, we describe our numerical simulations in 
§ EH and the statistics that we employ in § l2~2l In 
§EH and §EH we discuss our definitions and measure¬ 
ments of halo concentration and halo formation time re¬ 
spectively. In § 0 we explore the clustering dependence 
of both halo formation time and halo concentration, and 
present a model for the relative bias of halos as a function 
of concentration. In this section we also update previous 
results on the correlations between both halo formation 
time and halo concentration and the number of satellite 
halos contained within a host halo of fixed mass. Follow¬ 
ing this, we give the first explicit demonstration that host 
halo clustering is a function of the occupation number of 
satellite halos. In §0 we discuss the implications of our 
results for the halo model. We conclude with a summary 
of our primary results and a discussion of their implica¬ 
tions for galaxy formation models and for cosmological 
constraints derived from galaxy clustering in § 0 

2. METHODS 
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2.1. Numerical Simulations 

We investigate the environmental dependence of halo 
concentrations and halo occupation using cosmological 
./V-body simulations of structure formation in the concor¬ 
dance, flat ACDM cosmology with Dm = 1 — Da = 0.3, 
h = 0.7, and as = 0.9. The simulations were per¬ 
formed wit h the Ada ptive Refin ement Tree (ART) N- 
body code llKravtsov et ablll 9971) . The two simulations 
follow the evolution of 512 3 particles in computational 
boxes of size 120/i _1 Mpc and 80h -1 Mpc on a side re¬ 
spectively. We refer to these two simulations as “L120” 
and “L80.” The corresponding particle masses in these 
simulations are m v ~ 1.07 x 10 9 in L120 and 

m p ~ 3.16 x 10 8 r‘M 0 in L80. Both simulations use 
root computational grids of 512 3 cells and adaptively re¬ 
fine the grids according to the evolving local density fields 
to a maximum of 8 levels. This results in peak spatial 
resolutions of h pea k — 1.8h -1 kpc and h pea k ~ 1.2/i _1 kpc 
in comoving units for L120 and L80 respectively. 

We identify halos and subhalos (self-bound halos with 
centers located within the virial radius of a larger halo) 
using a varia nt of the Bo und Density Maxima algo¬ 
rithm 1BDM. iKlvnin et a1.11199(1) . Each halo is associ¬ 
ated with a density peak, identified using the density 
field smoothed with a 24-particle SPH kernel. All par¬ 
ticles within a search radius of rf = 25h _1 kpc, set to 
match the size of the smallest objects we aim to iden¬ 
tify, are removed from further consideration as poten¬ 
tial halo centers. The BDM algorithm iteratively re¬ 
moves unbound particles from each halo and uses the 
remaining bound particles to calculate halo properties 
such as the virial mass M v - Ir , circular velocity profile 
V c (r) = y/GM (< r)/r, maximum circular velocity V max , 
and the mass within a tidal truncation radius. A more 
detailed description o f the algorithm and spe cific param¬ 
eters used is given in lKravtsov et all (2004a). 

We define a virial radius R v as the radius of the 
sphere, centered on the density peak, within which the 
mean density is A v i r (.z) times the mean density of the 
universe, pu- The virial overdensity A v i r (z), is given 
by the spherical top-hat collapse appr oximation and we 
compu te it using the fitting function of lBrvan fc Normanl 
( 4998 ). In the ACDM cosmology that we adopt for our 
simulations, A v j r (z = 0) ~ 337 and A v j r (z) —> 178 at 
z > 1. In what follows, we use virial mass to characterize 
the masses of distinct host halos (i.e., halos whose cen¬ 
ters do not lie within the virial radius of a larger system). 
We quantify the sizes of subhalos using their maximum 
circular velocities, V max , because V max is measured more 
robustly in dense environments and, unlike mass, is not 
subject to the ambiguity of a particular definition. 

2.2. Correlation Statistics 

We quantify the dependence of clustering on halo prop¬ 
erties using the statistics of marked point distributions. 
For each halo property, or mark, quantified by some value 
m, the distribution of m over all halos may be character¬ 
ized by the standard one-point statistics, the mean (to), 
the variance V(m), and higher order moments. To quan¬ 
tify the dependence of clustering upon m one can con¬ 
struct the mark-correlation functi on (MCF) fcmm(r) = 
(77iiw ?)n(' r)/(m) 2 , with mark m llBeisbart fc Kersched 
120001 see iGottlober et all [2002 for another definition; 


IShethl (20051) . The notation indicates that (TOiTO 2 ) p (r) 
is the average of the product of toi and m 2 for halos at 
points x\ and X2 = X\ + r in pairs separated by distance 
r = |r|. Similarly, one can compute the average value of 
m, (m) p (r), on the condition that a halo is part of a pair 
at separation r. This formalism can easily be extended 
to discrete marks , like Hubbl e type or number of satellite 
halos (e.g.. lBeisbart fc Kerschea2OOO0 . 

Values of k mrn (r) > 1 indicate preferred clustering of 
halos with m higher than average. The magnitude of 
deviations of k mm (r ) from 1 is set by the size of fluctua¬ 
tions of m. In what follows, we employ a modified MCF 
that is normalized to the intrinsic one-point fluctuations 
in m, namely 

M m {r) = ((miTO 2 ) p (r) - (to) 2 )/V(to). (1) 

In the absence of spatial segregation on to, (mini'i) p (r) = 
(to) 2 and _A4 m (r) = 0. The deviations from the case of 
no segregation are expressed in units of V(to). Roughly 
speaking, a value of A4 m (r) = 0.25 is indicative that 
halos in a pair separated by distance r have values of to 
t hat are 0.5cr (= \Z0.25 c r) high e r than (to). 

iBeisbart fc Kersche^ l^OOgLJGqfrtdqbe^^jR 1 2002 ). 
and more recently. ISheth. Connolly, fc Skibba ( 2005), 
have used MCFs to study luminosity- and morphology- 
dependent clustering in observational , simu l ated , and 
semi-a nalyti c samples, r e specti vely. ISheth fc Tormenl 
(120041) and lHarker et all (120001) used the statistics of 
marked point distributions to study the dependence of 
halo clustering on mass assembly history in cosmological 
simulations. In this paper, we apply mark-correlation 
functions to study the environmental dependence of halo 
concentrations and subhalo abundance. 

2.3. Halo Concentrations 

The spherically-averaged density profiles of cosmo¬ 
l ogical halos can be d escrib ed by the profile of 
(Navarro. Frenk. fc White! 1 1997 . hereafter NFW) 

P(r) = Po(r-/r s ) _1 (l + r/r s )“ 2 . (2) 

The transition radius between the inner and outer power 
laws r s , is often quantified by the concentration param¬ 
eter c v i r = Rvir/cg. In the next section, we quantify 
the dependence of halo clustering on c v i r . We assign 
each halo a best-fit concentration by fitting halo den¬ 
sity pro files in lo gar ithm ically-spaced radial bins follow¬ 
ing iBullock et all 120011 B01 hereafter). We consider 
only halos with more than 250 particles within their 
virial radii, resulting in lower mass limits of M m , n — 
2.7x 10 11 hr 1 M 0 for L120 andM min ~ 7.9xl0 10 hfi* M 0 
for L80. In addition to the L120 and L80 simulations, we 
re-analyzed the simulation of B01 for the mean c v ir-Af v i r 
relation. The B01 simulation had the same formal spa¬ 
tial and mass resolution as L120 in a computational box 
of 60h -1 Mpc on a side. The cosmology adopted in B01 
was a ACDM cosmology with a power spectrum normal¬ 
ization of as = 1.0. Hereafter, we refer to this simulation 
as “L60.” We have reanalyzed the L60 simulation and 
reproduced the c v i r (Af v i r ) results of B01 using our tech¬ 
niques. 

The mean relation for c v ; r (AV v ; r ) in the L80 and L120 
simulations is well described by the model of B01 with 
modified model parameters. The scatter in c v ; r at fixed 
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Fig. 1.— Clustering and halo properties at z = 0. The top panels 
show MCFs .M ac (r) (solid lines), with normalized formation time 
a c , as mark for two different mass cuts, > 10 11,5 /i —1 Mq 

(> 300 particles) and M v i r > 10 12 /i - 1 M 0 (> 940 particles). The 
shaded bands represent the 95 th percentile of Aia c (r) formed from 
200 random reassignments of the marks to halos in the sample. 
The bottom panels show MCFs Mc wiT {r) with normalized concen¬ 
tration c vir as the mark at the same host halo mass thresholds. 
The lines and the shaded band have the same significance as the 
top panels. 


halo mass is well described for both simulations by a 
log-normal distribution with a standard deviation of 
cr(logCvir) — 0.14 at all probed ma sses in accordance 
with B01 and iWechsler et Till (120021) . In the notation 
of B01, we find that our c v i r (M v - lr ) relation is well de¬ 
scribed by the B01 model with F = 10~ 3 and K = 2.9. 
The revised parameters result in a shallower scaling of 
concentration with mass, and slightly lower values of the 
concentration around M*. It also predicts concentration 
values that are about 20% lower at 10 11 /i _1 M 0 , but 
note that this is still an extrapolation of the model to 
lower mass scales than it has been measured robustly by 
our simulations. A similar revisi on of the B01 parame¬ 
ters was previously pro posed by iDolag et all (I2004L see 
also iKuhlen et al.ll2005l who advocated lower K) to ac¬ 
commodate their simulations of cluster-sized halos and is 
in broad agreement w ith the profiles of observed clusters 
(IVikhlinin etalllSoOfill . The differences in the c v i r (A/ v i r ) 
relation are attributable to a combination of the differ¬ 
ences in the initial power spectra of the simulations and 
cosmic variance due to the finite sizes of the computa¬ 
tional volumes. 

2.4. Halo Formation Times 

For each halo in our sample, we have determined a 
mass accretion history M w \ T {a), by identifying the most 
massive progenitor of each halo as a function of time us- 
ing an algorithm similar to that of E^chsler_et_alJ (12QQ2I> 
more details are given in IAllgoo71l2flO- r il'l . TvWchsler et all 
(2002) found that the halo mass accretion histories can 
be characterized by a one-parameter family of trajecto- 



r [h _ 1 Mpc] r [h _ 1 Mpc] 


Fig. 2.— The dependence of clustering on c v i r as a function 
of redshift. The panels show MCFs A4 Cvir (r) (solid lines ) with 
normalized halo concentration c v i r , as mark for halos with M v i r > 
1011.50 a t four different redshifts. Shaded bands represent 

the 95 th percentile of A4 Cvir (r) from 200 random reassignments of 
the marks. 


ries of the form 


M (a) = Af 0 exp 



(3) 


where a 0 and M 0 are the scale factor and mass at the time 
the halo is observed. Equation © defines a formation 
scale factor a c . We assign to each halo a value of a c 
according to the value that best fits i ts mass accretion 
history, following lWechsler et al.i (j200211) . In the following 
section, we address halo clustering as a function of a c . 

The formation time a c has a number of advantages 
over other definitions of halo formation times, such as 
the times when halos first acquire fixed fractions of their 
final mass. The quantity a c is less sensitive to individual 
events in the formation of a halo, as it is based on the 
entire mass accretion history o f each halo, r athe r then a 
single epoch. As shown bv IWechsler et all ( 2002 1. it also 
has the property that its distribution and average value 
are only a function of m ass, and not redshift. 

IWechsler et all (J2002) showed that the formation time 
a c is tightly correlated with c v i r : the mean relation given 
by Cvir = ci/a C) where ci is the concentration of ha¬ 
los forming today. In the ACDM cosmology adopted 
in our simulation, these correspond to halos w ith M ~ 
10 15 ft. _1 M 0 , and c\ ~ 4 (see IWechsler et all 120021 for 
details). At fixed redshift, a c (M v ir) is a weak func¬ 
tion of M v ; r and the distribution of a c at fixed mass 
can be characterized by a lo g-normal distribution with 
er(loga c ) = fj(log c v ir) — 0.14 (IWechsler et ~ l200l . 


3. RESULTS 

3.1. Formation Time and Concentration Marks 

Figure ^ shows mark-correlation functions with halo 
formation time and concentration used as marks. The 
simple mass dependence of a c and c v i r , and the simple 
distributions of these quantities at fixed mass allow us to 
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scale out the gross mass dependence of these quantities in 
studying formation time- and concentration-dependent 
clustering. We accomplish this by assigning each halo a 
normalized formation time a c = a c /(a c (AI v i r )) and con¬ 
centration Cvir = c v ir/(c v ir(A/vir))j where (a c (M v i r )) and 
(c v ir(Af v i r )) are the averages of formation time and con¬ 
centration as a function M v j r computed in bins of width 
Alog(M vir ) = 0.20. 

Consider first the top panels of Figure Cl where we 
show MCFs A ■ia c (r) with mark a c , for distinct halos 
above two mass thresholds, M v ; r > 10 11 ' 5 /i _ 1 Mq and 
A/ V ir > 10 12 h 1 ]VIq. We represent the statistical signifi¬ 
cance of deviations of the MCF from the null hypotheses 
of the absence of spatial segregation on formation time or 
concentration by randomly reassigning the marks among 
the halos in the sample 200 times and recomputing the 
MCFs on these random samples. The shaded regions in 
Figured and all MCF figures that follow show the enve¬ 
lope formed by 95% of these randomized MCFs. 

Figure d shows a statistically-significant tendency for 
early-forming (low a c ) halos to be more strongly clus¬ 
tered in both mass bins. The strength of the trend dimin¬ 
ishes with increasin g mass. These res ults a re in qualita¬ 
tive a greement with lGao et all 120051) and lHarker et all 
(2006), who define formation time as the time the halo 
first acquired half of its final mass. We find a similar 
signal for this definition of formation time as well. This 
figure also clearly shows a dependence with scale, with 
stronger trends at a few /i _1 Mpc than on larger scales 
(note that the spread in the range for randomized sam¬ 
ples indicates larger errors at small radii due to fewer 
pairs of halos in these bins). In what follows, we explore 
the redshift dependence of clustering as a function of 
concentration c v i r , because robust determinations of a c 
become increasingly difficult at high redshift as smaller 
portions of the halo mass accretion histories are sampled. 

The known relationship between formation time and 
Cvir suggests that halo clustering should be a strong 
function of c v i r unless the relationship between c v i r and 
a c i s its elf a strong funct ion of environment. Nei¬ 
ther iLemson fc Kanj ffmarm^ looki ng for trends 

with density, nor iShethfcTbrmenl {2004) , using mark- 
correlation statistics, were able to detect any significant 
clustering segregation with concentration. However, the 
simulations employed in these studies were not partic¬ 
ularly well suited to resolve the detailed density struc¬ 
tures of halos, especially at low mass where the trends 
are strongest. Our simulations cover a similar computa¬ 
tional volume, but they have substantially higher mass 
and force resolutions compared with these earlier studies. 

Figure U clearly shows a tendency for preferential clus¬ 
tering of halos selected by their concentrations. In the 
bottom panels, we show A4 Cvir (r) at z = 0 for host ha¬ 
los above two mass thresholds. As might have been ex¬ 
pected based on the a c -dependent clustering, halos with 
high concentrations are more strongly clustered than av¬ 
erage. As was the case for formation times, the strength 
of the c v i r -dependent clustering is striking at the low¬ 
est masses. Halos with Af v ; r > 10 115 h~ 1 M Q in pairs 
separated by ^ 3 Ii^ 1 Mpc tend to have values of c v i r 
more than ~ ler above the mean relation. The statisti¬ 
cal significance of this preferential clustering persists to 
separations r > 10 /i _1 Mpc, where halos in pairs have 


Cvir values JO 0.5cr above the mean. Just as with a c , the 
Cvir-dependent clustering is a decreasing function of halo 
mass over this range. The figure indicates that in our 
simulation, the dependence of clustering on concentra¬ 
tion is even stronger than the dependence of clustering 
on halo formation time. We have verified this by re¬ 
making the bottom half of Figure El with the variable 
c QC = C\/a c as the mark. This plot looks quite similar 
to the inverse of the formation time mark, and does not 
show as strong of a signal as the measured concentration. 
It seems likely that this discrepancy is just due to larger 
measurement errors in formation time, but a more de¬ 
tailed analysis with a larger simulation will be necessary 
to determine this. 

We explore c v i r -dependent clustering at fixed virial 
mass (M v ir > lO n - 5 M 0 ) as a function of redshift in Fig¬ 
ure 0 Interestingly, this effect is a strong function of 
redshift. Indeed the sense of the clustering trend reverses 
over the redshift range shown here. Above this fixed ab¬ 
solute mass threshold, high-concentration halos are more 
strongly clustered at z = 0, while at z ~ 1 there is, at 
most, a weak trend and at z^ 2 low-concentration halos 
tend to be clustered more strongly. This tendency for 
halos with low c v i r values to be more weakly clustered 
at high redshift increases steadily with redshift there¬ 
after. This trend suggests that late-forming (high-a c ) 
halos are actually more strongly clustered at high red¬ 
shift though we show no direct statistically-significant 
evidence of this, largely due to the difficulty in making 
robust determinations of a c at high redshift. 

3.2. Concentration-Dependent Halo Bias 

Due to resolution requirements for measuring c v i r , we 
sample halos above a fixed mass at each redshift in or¬ 
der to compute the MCFs in Figure [3 but the typ¬ 
ical collapsing mass A/*, is a declining function of z. 
At z = 0, A/* ~ 8.4 x 10 12 /i _1 M 0 , while by z = 2, 
M* ~ 1.9 x 10 10 hr 1 M 0 , so objects at fixed mass be¬ 
come increasingly rare with increasing z. It is natural 
to suspect that the mass and redshift dependence found 
in Figures ^and [3 may have a common origin, due to 
Cvir- or a c -dependent clustering that is a function of the 
relative rarity of the peaks from wh ich these halos form 
in the primordial density field ('e.g.. lMo Rr, WhitfU l 996). 
To test this hypothesis, we explore the relative clustering 
of halos as a function of concentration and scaled mass 
M vir /M*. 

We are unable to test this mass scaling over a large 
dynamic range at a single redshift due to the limited 
dynamic range of our simulations. In order to explore 
this Mvi r /A/* scaling we use the L80 simulation at z = 0 
to explore the low-M v i r /M* regime and we use several 
timesteps from the L120 simulation to probe higher val¬ 
ues of Mvir/M*. To quantify c v i r -dependent clustering, 
we define a relative bias for a subsample of halos com¬ 
pared to all halos in the same mass range, 

frLrM™) = Csubsample(r|TO)/Call(r|fh), (4) 

where we have defined a scaled mass variable m = 
M v i r /M*. For each subsample, b Cvit (r) is consistent with 
being constant over 5 < rj h -1 Mpc < 10, so we reduce 
this function to one number: 6 2 vir taken over the range of 
halo separations r = 5—10 /) _1 Mpc. In Figured we com- 
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Fig. 3. — Relative bias squared for halo samples selected by quar- 
tiles in c v ir and thresholds in the mass variable m, compared to the 
bias of all halos above the same mass threshold. Each set of curves 
shows the mean bias for the indicated c v i r quartile. The shaded 
bands represent the 68% region constructed from 200 random sub¬ 
samples of the unbiased population with the same size as the biased 
subsample. The leftmost segments are taken from the z = 0 out¬ 
put of the L80 simulation and are labeled by “80 h~ 1 Mpc”. The 
remaining segments are taken from different redshift outputs of the 
L120 simulation (labeled “120 /i -1 Mpc”) as indicated in order to 
fill in the entire range of M v i r /M*. The left edge of each segment 
is determined by a minimum of 250 particles in a halo, while the 
right edge is limited by requiring that there be more than 1500 
halos in each subsample. 


pare this relative bias as a function of scaled halo mass 
ir (m) for several subsamples selected on percentiles of 
c v j r . The shaded bands account for the measurement 
error in bf, vu due to sub-sampling the full halo distribu¬ 
tion by recomputing £ a ii(r) from 200 random subsam¬ 
ples of the total halo population with the same number 
of objects as contained in each subsample. The bands 
represent the contours containing 68% of the b 2 Cvir values 
computed in this manner. Different ranges in M v i r /M* 
are covered by simulation outputs at different redshifts 
as labeled in the figure. Scaling the results by M* delin¬ 
eates a well-defined trend in this concentration bias as a 
function of to. In each redshift range, we are limited at 
the low-mass end by resolution; we require that a halo 
have at least 250 particles within its virial radius in order 
to be considered. At the high-mass end, the bands are 
limited by the requirement that there be at least 1500 
halos in each subsample. These requirements give rise 
to the finite length of each segment in Figure 0 Results 
from the L80 simulation at z ~ 1 are in good agreement 
with the L120 simulation at m ~ 0.1; however, we do not 
plot these in the interest of clarity. 

Figure 0 clearly demonstrates the trend already indi¬ 
cated by the mark-correlation functions for the highest- 
c v ir halos to be much more strongly clustered than av¬ 
erage for M v i r < M* and less strongly clustered than the 
overall halo population for M v j r > M*. It is worth noting 
at this point that above M*, where the scaling of bias 


with mass is very strong, mass is still the dominant vari¬ 
able in determining bias. However, well below M*, the 
scaling of bias with mass flattens, and formation time 
appears to be the dominant variable determining bias. 

Below, we provide a fitting function for the bias as a 
function of both concentration and mass. Our simula¬ 
tion data are not sufficient to determine this function 
with high accuracy, but we present this function to give 
a convenient way to estimate the magnitude of the ef¬ 
fects of these trends in particular applications such as 
the clustering of specific galaxy populations. 

Let d = lll(Cvir )/cr(lnCvir) = log(c v ir )/cr(l0gC v ir), 
such that the probability distribution of d at fixed halo 
mass P(c')dc' is Gaussian with unit variance. We define 
the relative bias of halos as a function of c', as the ratio 
of the clustering amplitude of halos of fixed m and fixed 
c' relative to the clustering amplitude of all halos of fixed 


to , 


& c vi r( c 'l m ) 


£0", c'\m) 
£(r\m) 


(5) 


where here we have again taken the average of the halo 
bias over separations from 5 < r/ /i -1 Mpc < 10. We 
find that a good fit to the simulation data is given by 


5 Cvir (c'|m) = p(m) + q(m)c' + 1.61[1 - p(m)]c' 2 , (6) 


where 


p{m) = 0.95 + 0.042 ln(?A 0 - 33 ) 

0.22[to°- 33 + ln(m 0 ’ 33 )] 
<7(m) = 0.1- 


fl 


' 7 , 0.331 


The best fitting parameters for this relation satisfy the 
normalization condition 


& c vir (c / |?rt)P(c , )dc' = 1.0 


(7) 


to within a few percent. The simulation results are con¬ 
sistent with this normalization within the sizable errors, 
and we justify this constraint in more detail in §0] where 
we discuss the implications of this relative bias of halos 
on the halo model. In Figure^] we show the fit of Eq. © 
compared to the relative bias measured from the simu¬ 
lations as a function of the scaled concentration variable 
c', for several values of the scaled halo mass to. 


3.3. Halo Occupation Mark 

We expect that this clustering effect may extend to 
other properties of halos and the galaxies they host, 
especially those which are known to be strongly cor¬ 
related with formation time and halo structure. Halo 
angular momentum and halo shape are two such halo 
properties that are relevant to galaxy formation and 
known to correlate well with halo formatio n history (e.g., 
IVitvitska et al.ll20Q2t lAllgood et alJl2006lf 

The quantity from dissipationless simulations that is 
most pertinent to models of the statistics of galaxy clus¬ 
tering is P(fV sa t|M v ir), the probability distribution of 
the number of subhalos per host halo at fixed host halo 
mass. In such simulations this is the best proxy for the 
num ber dist rib ution of satellite galaxies per halo (e.g., 
iKravtsov et all 12004 ;I) . The probability distribution of 
this number of satellite galaxies per halo as a function of 
halo mass, P(iV sat |M v i r ) is a primary ingredient in halo 
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c' = ln[c vir /<c vlr (M vlr )>] 
ff ( ln c vir ) 


Fig. 4. — Relative bias as a function of normalized concentration, 
d = log(c v ir )/cr(log c v ir ), for various values of halo mass scaled 
by the typical collapsing mass rh = M v j r /M*. The points show the 
values of the relative bias, 6 Cvir , measured directly from the host 
halos in the LI20 and L80 simulations and the lines show the fit 
of Eq. 151 . We show the relative bias at five different scaled halo 
masses, rh = 33, rh = 13, rh = 0.36, rh = 0.06, and rh = 0.01. 


model calculations of g alaxy clustering (see § 0] below). 
iSheth fc TormeiJ ((2004) and lGao et all ((2005) have em¬ 
phasized that the formation time dependence of cluster¬ 
ing breaks a fundamental assumption of the halo model, 
namely that galaxies populate halos of a given mass in 
a manner that is statistically independent of halo envi¬ 
ronment. In fact, this is true only if P(V sat |M v i r ) is a 
function of halo formation time. Subhalos are natural 
sites for galaxy formation, so a more direct test is to 
sh ow that halos cluste r differently as a function of N sa , t ■ 
iZentner et al . (l2005ll showed that both a c and c v i r are 
strongly correlated with V sat in host halos of fixed mass. 
We update this correlation for the massive halos in the 
L120 and L80 simulations in Figure 0 where we com¬ 
pare the number of satellites with Mh os t > 10 3 M, u b 
in the massive host halos of the L120 and L80 simula¬ 
tions with the host halo concentrations and formation 
times. Scaling the satellite number with respect to the 
host mass normalizes out the gross dependence of satel¬ 
lite number on host halo mass. Moreover, we have nor¬ 
malized both c v i r and a c to their average values as a 
function of halo mass. Figure 0 clearly shows that early- 
forming, high-concentration halos have fewer satellites. 
The basic reason is that halos that accrete their sub¬ 
halos first have more time for those subhalos to be de¬ 


stroyed or to merge with the central ob j ect d ue to dynam¬ 
ical f ri ction (e.g-.lKravtsoy fc Klypinlll999tlTaffoni et al.l 


200]| Zentner & Bullock 

2003; Zentner et all SOOl 

van den Bosch et al.1 2005 

Tavlor fe Babul 2003). 


In light of this strong correlation, the cluster¬ 
ing dependence of formation time and halo concen¬ 
tration found in the previous section suggests that 
halo clustering is lik e ly to b e a function of N KPI ± a s 
well. iKravtsov et, all ll2004alh iTasitsiomi et, all ll2004fl . 



0.5 1.0 0.5 1.0 1.5 

a c /<a c > c vi /<c vir > 

Fig. 5.— Correlation between halo occupation number and for¬ 
mation scale factor ( left panel) and concentration ( right panel), 
counting subhalos 1000 times less massive than their hosts. Host 
halos more massive than 1 x 10 14 1 Mq are plotted from the 
L120 simulation ( red circles), and host halos more massive than 
5 x 10 13 /i -1 Mq are plotted from the L80 simulation ( green di¬ 
amonds). Each of the variables is normalized to the mean of the 
variable as a function of halo mass. 


and IConrov. Wechsler. fc Kravtsov (1200 (11 have demon¬ 
strated that halos and subhalos selected by their max¬ 
imum circular velocities provide excellent matches to 
the observed galaxy-galaxy autocorrelation function, 
galaxy-mass cross correlation function, as well as to the 
luminosity-dependence a nd redshift evolutio n of cluster¬ 
ing, respectively (see also lBerrier et a.Ol2006l for a similar 
result for close-pair sta tistic s). Following these studies 
and the arguments in § 12.11 for quantifying subhalo size 
as a function of maximum circular velocity, we study host 
halo clustering as a function of the number of satellites 
above a U max threshold as a quantity that is particularly 
relevant to galaxy clustering predictions. 

In Figure 0 we make a first attempt to quantify the 
strength of clustering for samples of halos marked by 
their occupation number. For two halo samples, we have 
selected subhalos with maximum circular velocities I4at, 
above a fixed fraction of the maximum circular velocities 
of their hosts Uhost- Taking this ratio of circular velocities 
scales out the dependence of V sat on host halo size. Fig¬ 
ure 0 shows a relatively small but statistically-significant 
tendency for halos in pairs separated by ~ 5 — 10 h -1 Mpc 
to have above-average numbers of satellites. This is the 
most direct demonstration yet that the halo occupation 
by galaxies is a function of environment. 

Note that in Figure 0 we are forced to study only large 
host halos in order to guarantee that their subhalos are 
well resolved. As such, we probe a different range of host 
halo masses than shown in Figure 0 The sense of the 
trend is what we would expect at this mass range, which 
is slightly bigger than M*, from the correlation between 
a c and V sat . Low-concentration, late-forming halos in 
this mass range are more clustered than average, and it 
is these halos that are expected to have more satellites. 
Based on the previous results for a c - and c v i r -dependent 
clustering, if one were able to measure the clustering of 
low-mass halos with several satellites, this trend may re¬ 
verse. 

4. PROPERTY-DEPENDENT HALO CLUSTERING AND 
THE HALO MODEL 

The results of the previous two sections have poten¬ 
tially important implications for the halo model of clus- 



















































WECHSLER et al 



10 1 


10 


r [h -1 Mpc] r [h -1 Mpc] 


1.6 
1.2 
0.8 
0.4 " 
0 

- 0.4 ; 
- 0.8 
- 1.2 
- 1.6 


Fig. 6.— The dependence of clustering on satellite number. 
The panels show MCFs A4jv sat ( r ) (solid lines), with the number 
of dark matter subhalos N sa t , as mark. The two panels correspond 
to different samples. In the left panel, results are shown for all 
host halos with maximum circular velocities Vhost > 300 kms —1 
and satellites with \4 a t > Ei 0 st/3. In the right panel, we show 
a sample with Vh os t > 400 kms -1 and V S at > Fh os t/4. In each 
panel, the shaded bands have the same meaning as in Figure [5] 


tering. The basic idea behind the halo model frame¬ 
work has a long history, initially in analytic mod¬ 
els that described galaxy clustering as a superposi¬ 
tion of randomly-distributed clusters with specifi ed pro¬ 
files and_a 1 ^ange_o£_cluster masses_J^fe£^^^^cot3 
119521 iMcClelland fe SilklfT977t iPeeblesI I1974J). The e~ 
plosion of recent activity in this field has been partly 
fueled by the recognition that a combination of this 
approach with recently developed tools for predict¬ 
ing the spatial clustering of dark matter halos (e.g., 
MofcWhjtd Il996t |Sheth_^T onQe3 [19991; Sheth e t al.1 


2001bt iSeliak fc Warrenl 12004 : ITinker et al, 2005 1 pro¬ 
vides a powerful formalism for analytic calculations of 
dark matter clustering, which can be extended naturally 
to biased galaxy populations. 

In modern implementations of the halo model (e.g., 
IScherrer fc Bertschingerl ITooTl ISeliakl 120001 IMa fe Frv 

120001 iPeacock fc Smith! l2000t iScoccimarro et all 1200111 


two-point galaxy clustering is calculated by specifying 
the clustering of dark matter, the non-linear clustering 
of dark matter halos, the first two moments of the HOD, 
and the spatial distribution of galaxies within their host 
halos. The standard implementation also assumes that 
halo clustering is independent of all halo properties aside 
from halo mass. In particular, in calculations of either 
galaxy or dark matter correlation functions, it is assumed 
that the HOD and halo concentrations depend only on 
halo mass and that there is no spatial segregation of ha¬ 
los based on their occupation numbers, concentrations, 
or any other properties that may be relevant to the prop¬ 
erties of the galaxies that the halos host. We refer to 
a halo model based on this set of assumptions as the 
“strong” halo model. As we have shown, these assump¬ 
tions are not generically valid: the two-point clustering 
of dark matter halos depends on halo concentration, halo 
formation time, and halo occupation. Below, we review 
several salient aspects of the halo model and explore the 
implications of relaxing these assumptions about the lack 
of environmental dependence of halo properties. As an 
illustrative example that is closely tied to the results of 
the previous section, we focus most of our attention on 
relaxing the assumption that clustering is independent 
of the halo profile concentration. It is worth keeping in 


mind that there are several different implementations of 
various aspects of the halo model: choices must be made 
about how to model the translinear regime for galax¬ 
ies (especially the treat ment of halo excl usion and scale- 
dependent bias, see e.g. ITinker et al.l200(j Appendix B), 
what analytic models to use for the mass fun ction and 
bias, and how to mo del the halo occupation llZhengl2fl04 
iConrov et alJ 120061 e.g.) and the profiles of galaxies in 
halos. 


4.1. The Dark Matter Correlation Function in the 
Standard Halo Model 

The simplest application of the halo model is to calcu¬ 
late the dark matter two-point correlation function. The 
halo model breaks the computation into a “one-halo” 
term receiving contributions from the mass density in 
individual halos on small scales and a “two-halo” term 
with contributions from mass in distinct pairs of halos on 
large scales. In the standard halo model, we can write 
the lar ge-scale hal o correlat ion function of dark matter 
as (IScherrer fc Bertschingeil 19911) : 

U m (r)=e h (r)+e h (r) + l. ( 8 ) 


The one-halo term is 

£ lh (r) = ~T [ dm m 2 f d 3 xX m (x)X m (x + r), 
Pm J dm J 

_ (9) 

with dn(m)/dm the mass function of halos and X m {x) the 
density distribution within a halo of mass to normalized 
so that the integral of the profile over the volume of the 
halo is unity. The two-halo term is 




' f dn(mi) d?z(?n 2 ) 

dmi / dm2 TOi—^- to 2 - 


dmi 

x jd 3 x J d 3 y X mi {x)Xm 2 {y) 
x 6ih0?-y + f|rni,m 2 ), 


dm 2 


( 10 ) 


where £hh(:?|?7ii, to 2 ) is the cross-correlation function of 
halos of mass m\ and to 2 and r = |r|. 

In the limit of separations much larger than the sizes of 
the largest halos, the correlation function is determined 
by the two-halo term alone. On such large scales, the cor¬ 
relation functions vary little over the length scales of ha¬ 
los so that £hh(*£ — V + F|mi, to 2 ) ~ Chh( c|toi, to 2 ), which 
allows the last two integrals in Eq. m to be replaced by 
£hh(F|?T'ii, to 2 ). Relating the halo correlation functions to 
the dark matter correlation function through the stan¬ 
dard assumption fhh(r\mi,m 2 ) ~ b h (mi)bh(m 2 )fdm(r) 
and requiring £ 2h (r) = on large scales forces the 

halo bias bh(m) to obey the constraint 


/ 


dm 


dn(m ) 
dm 



bh(m) = 1. 


( 11 ) 


This is the well-known normalization rule for the mass- 
dependent halo bias. 


4.2. The Galaxy Correlation Function in the Halo 
Model 

This model can be used to compute the statistics of any 
population for which all members reside in dark matter 
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halos. The most popular application is to compute the 
correlation statistics of galaxies. The equations of § 14.11 
can be adapted to this application simply by making the 
following substitutions. First, take —> n g / 2, where 

rig is the mean number density of galaxies. Take m —> 
(iVgai )m so that the mean number of galaxies per halo of 
mass to is counted rather than the mass per halo. Take 
to 2 —> (AIg a i (A^gai — 1)) m /2 so that pairs of galaxies within 
halos of mass m are counted. Finally, take X m (x) —> 
(x) to represent the the mean distribution of galaxies 
within host halos of mass m rather than the distribution 
of mass within halos. 

Following the logic of the previous section, we can com¬ 
pute the large-scale clustering of galaxies from the two- 
halo term alone. This leads to the well-known and useful 
relation for the large-scale bias of a galaxy population 
given the first moment of its HOD, (N ga i) m , 

frgai - =- [ dm d/ ^ y?t -(JV gal ) m 6 h (m). (12) 

n g J dm 

As one might expect, the large-scale bias of galaxies is 
given by a simple, weighted average of the bias of the 
halos in which they reside. 


4.3. The Dark Matter Correlation Function with 
c v ir -Dependent Halo Clustering 

Now consider recasting the halo model allowing halo 
clustering to be a function of both halo mass to, and 
the additional property of concentration c. If we define 
the probability of a value of c at fixed mass as P(c|m) 
then the number of halos of mass m with concentration c 
is dn(m, c)/dm.dc = P(c|TO)dn(m)/d?77,. This additional 
property complicates the halo model and requires an ex¬ 
tra integral over the distribution of halo concentrations. 
Specifically, the one- and two-halo terms become 

£ lh (r) = -r f dm f dcm 2 P{c\m)^j^- 
Pm J J 

x d 3 xX m (x\c)X m (x + r\c) (13) 


and 


£ 2h (r) = — J dmi J dc± J dm 2 J dc 2 


Pm 

dn(TOi) dn(m 2 ) 

x mi —j-P(ci mi) to 2 —r-P(c 2 |m 2 ) 

dmi dTO 2 

x J d 3 x J d 3 yX mi (x\c 1 ) ^rri2 ( 2 /(^ 2 ) 
x £hh(A — y + f|TOi, ci, m 2 , c 2 ), (14) 


where X m (x\c) is the density distribution for a halo of 
mass to and concentration c, and where the cross cor¬ 
relation of halos of mass mi and concentration ci with 
halos of mass m 2 and concentration c 2 is embodied in 
£hh(a?|m 1 ,c 1 ,m 2 ,c 2 ). 

Two consequences of these relations are evident. The 
first is simply that the one-halo term at small separations 
should be a weighted average of profile convolutions with 
the weighting given by P(c|to) dn(m)/dTO. This is true 
independent of concentration-dependent clustering has 
bee n overloo ked in most modeling efforts (although, see 
e.g. ISheth~ et~al J12001a ). The effect of adding scatter is 


only at the few percent level, but this will be important 
for future precision measurements of dark energy using 
lensing as the statistical uncertainty of the experiments 
approaches this level. Second, at intermediate scales (r ~ 
several Mpc), neglecting concentration-dependent clus¬ 
tering will lead to differences in the calculation of the 
dark matter correlation function due to a combination of 
the preferential clustering and the convolution factors. 

In § 14.11 we derived the normalization relation for 
the standard mass-dependent halo bias and an anal¬ 
ogous relation holds for the concentration-dependent 
relative halo bias 6 Cvir that we use in this pa¬ 
per. We can write the halo-halo cross correla¬ 
tion factor in terms of a concentration-dependent 
relative bias of halos with respect to all halos at 
fixed mass defined so that £hh(r\mi, Ci, m 2 , c 2 ) = 
b 2 vir (ci, c 2 , |TOi,m 2 )^hh(r|mi,m 2 ). Counting pairs of all 
halos of fixed mass must give the same result regardless 
of whether or not we subdivide the halo population by 
concentration at fixed mass. This requirement gives the 
general normalization condition 

J dci J dc 2 P(ci|TOi)P(c 2 |TO 2 )6 2 vir (ci,c 2 |TOi,TO 2 ) = 1. 

(15) 

Assuming that we can write the bias term as 
fre vi r( Cl ’ C2 l mi ’ m2 ) = & c vir (ci|mi)& Cvir (c 2 |?n 2 ) as in mod¬ 
els of deterministic bias, the normalization condition is 
then 

J dcP(c\m)b Cvit (c\m) = 1. (16) 

This normalization is consistent with our measurements 
of the relative concentration-dependent bias, and holds 
to within a few percent for the fitting formulae of § 13.21 

4.4. Galaxy Clustering with Concentration-Dependent 

Bias 

As with the standard halo model, a halo model incor¬ 
porating concentration-dependent halo clustering can be 
formally extended to galaxy clustering in a simple man¬ 
ner, though this extension may become cumbersome in 
practice. As a first attempt, we may assume that the 
number of galaxies per halo is independent of halo con¬ 
centration so that as in the previous discussion the sub¬ 
stitutions to -> (AT ga i) m and to 2 -> (AT ga i(Ar ga i - l)) m /2 
can be taken in Eq. Q and Eq. ci. The consequences 
of the concentration-dependent clustering are then quite 
similar to the case of the dark matter correlation func¬ 
tion, but are less direct because of inherent uncertainties 
in the link between the matter distribution within halos, 
A m (x|c), and the distribution of galaxies within h alos, 
A®j(xjc) iNagai fc Kravtsovl[2005t IChen et aH[2005lh 

Another possibility is that the galaxy HOD does de¬ 
pend on host halo concentration. Indeed, in Figure d 
we confirm that the number of dark matter subhalos per 
host halo at a fixed host halo mass does correlate with 
the halo concentration, and we have shown that host halo 
clustering is a function of the number of satellite halos 
that they contain. A more general and realistic assump¬ 
tion then seems to be to consider galaxy number as a 
function of both c and to so that the appropriate substi¬ 
tutions into the dark matter correlation function equa¬ 
tions [Eq. (THfl) and Eq. d] are to —> {N ga i(m,c)) miC 
and to 2 —> (N ga [[Ng a i — 1](to, c)) m>c . In this case, the 
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large-scale bias of a particular sample of galaxies can 
then be written as 

1 f d n(m) 

b g ai~— / dm— j - b h (m) 

rig J dm 

x J dc(fV ga i(m,c))6 Cvir (c|m)P(c|m). (17) 

The bias is weighted over both the concentration distri¬ 
bution and the mass distribution. As such, the model 
can be compared to the standard halo model using an 
effective halo occupation, the mean of which is given by 

Ngai(m) = J dcP(c\m)b Cvlr (c\m)(N gai (rn,c)} . (18) 

Note that because early-forming halos have a lower av¬ 
erage satellite number, for luminosity-selected samples it 
is possible that this could offset the higher bias and re¬ 
sult in galaxy clustering that does not depend strongly 
on c V in but this is unlikely to be the case for samples 
that are more directly connected to formation time. 

These arguments can be used and extended to account 
for any additional dependence of halo clustering on halo 
properties and corresponding halo occupation distribu¬ 
tion, most naturally formation time. Although a larger 
halo sample will be needed to fully characterize these 
trends, this first indication of the mass and redshift scal¬ 
ing of the trends of bias with halo properties that we give 
here should prove useful in order to estimate the size of 
these effects. 

4.5. General Implications 

Although these results urge caution in using the stan¬ 
dard halo model assumptions to calculate galaxy clus¬ 
tering statistics and infer cosmological parameters with 
high precision, it is not clear that they will have a large 
effect for galaxy samples that are selected by mass or, as 
in observational samples, by lumi nosity . One indi cation 
was given by the following test. iZentner et all ( 2005 1 
used the standard halo model combined with halo occu¬ 
pation derived from an analytic model for the evolution 
of halo substructure to predict the two-point correlation 
function for galaxies, and obtained a result that was con¬ 
sistent with the results of simulations that include all of 
the effects mentioned here. Still, it may be that small dis¬ 
crepancies would manifest if a lar ger sample wer e used 
to make this comparison. Second. IConrov et a D d2nn^h 
have compared estimates of the HOD from galaxy clus¬ 
tering mea sur ed i n the Sloan Digital Sky Survey (SDSS) 
llTinker et al.l2f)05l) . where these effects are ignored, with 
the HOD of subhalos measured in these same simula¬ 
tions, where again these effects are included implicitly, 
and found that they are in better than ~ 10% agreement 
for galaxies brighter than M r = —19. 

Other authors have investigated whether the halo oc¬ 
cupation is dependent on envi ronment witho ut finding 
any such trends. For example. iBerlind et al.l ( 2003 ) in¬ 
vestigated mean halo occupation as a function of local 
density in hydrodynamic simulation in a computational 
box 50 ft -1 Mpc on a side. These authors measured local 
density in 4/i -1 Mpc spheres around host halos a nd found 
no indication of such a trend. |YoQ_al_aD (120051) used th e 
same hydrodynamic simulation as lRerlind et all 120-031) , 
to determine whether environment-dependent halo oc¬ 
cupation effects could be detected in galaxy correlation 


statistics. By swapping the galaxy po pulations b et ween 
halo populations with similar masses, lYoo et ah i'2()() 5' 
found 5 — 10% effects on the galaxy-galaxy and galaxy- 
mass correlations, which were within their statistical un¬ 
certainties. 

These tests all indicate that the effects for mass- or 
luminosity-selected samples are at the 10% or lower level. 
However, we caution that these studies all employed 
simulations of relatively small volumes. We have per¬ 
formed a comparable test to that shown in Figure 0 us¬ 
ing our smaller 60 /i _1 Mpc box (L60 studied by B01) and 
found no statistically-significant effect due to the eight¬ 
fold smaller computational volume. As such, it is not 
surprising that these earlier studies were unable to find 
a conclusive result with their simulation of a yet smaller 
volume. This indicates that large, high-resolution sim¬ 
ulations (suitable for detecting subhalos in M v i r < M* 
hosts) will be necessary to determine whether the rel¬ 
ative bias measured by halo occupation has the same 
behavior as might be expected from using concentration 
in addition to the global relationship between concentra¬ 
tion and satellite number as a proxy for halo occupation 
for all halos in the volume. 

These trends have not been readily apparent in ob¬ 
servations to date, although a detailed comparison is 
complicated by the difficulties in selecting a sample 
that closely corresponds to a mass- and concentration 
or formation history-selected sample. Recent stud¬ 
ies looking at galaxy clustering in the SDSS have not 
found any tr end of clustering with galaxy properties. 
iSkibba et alJ ( 2006 ) used luminosity-marked correlation 
functions to test whether the luminosity-dependent clus¬ 
tering is consistent with being a simple consequence 
of mass-dependent clust ering, and found that it is, us¬ 
ing ~ 0.5L* galaxies. lAbbas fc Shethl (120061) showed 
that the observed environmental dependence of cluster¬ 
ing in SDSS could be fully accounted for by correlations 
between galaxy properties and host mass and by host 
mass and environment. Another study by Blanton and 
Berlind (in preparation) shuffled 0.3L* galaxies be¬ 
tween groups of the same luminosity and found that the 
red and blue correlation functions remain unchanged at 
the ~ 5% level. 

These results offer some confirmation of the theoret¬ 
ical indications that the effects aren’t strong in this 
regime, but are not terribly surprising in light of our 
results, because for halos with concentrations more than 
la from the mean, the effect is less than 20% in the 
range 0.5 M*. < M v i r < fOM*. Still, the last study 
did look at halos well below M*, and still did not see 
a strong trend; this may indicate that the correlations 
between formation time and galaxy observables are not 
extrem ely stron g. An additio nal stu dy was performed by 
lYang. Mo. fc van den Boschl ( 2006 ). who measured the 
clustering of a sample of groups selected from the 2dF. 
They investigated the relative clustering of groups of a 
given mass as a function of their spectral type, which 
one might expect to correlate with formation epoch for 
galaxies at fixed mass, and found that clusters with early- 
type central galaxies were more clustered than clusters of 
the same luminosity that had late-type central galaxies. 
This goes in the sense of our predicted trend for low mass 
halos but they found that it extended to cluster masses. 
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A first attempt to connect these trends directly with 
observable quantities was made bv iCroton et all ( 2006 ) 
after this paper was submitted. This paper uses a semi- 
analytic galaxy formation model to investigate the affect 
of assembly bias on the total galaxy sample and on sam¬ 
ples selected by color, and finds that effects ranging from 
a few percent to almost a factor of two depending on the 
selection. At first glance, the predictions of this model 
do seem to be in mild conflict with the lack of trends 
seen in the studies above, but this is still unclear. First, 
shuffling galaxies on observational proxies for mass may 
lead to much weaker effects than shuffling on the the¬ 
oretical halo mass. Secondly, the detailed predictions 
for observed galaxies may dependent sensitively on the 
galaxy formation model, and it may be that the lack 
of observed trends indicates that color is less correlated 
with formation time than in this particular model. 

The trends we have presented here are likely to be more 
important for studies of the clustering of extreme objects 
that may be thought to form particularly early or late or 
have particular formation histories. Examples of such 
populations would be color or star formation selected 
samples, high-mass clusters selected by occupation num¬ 
ber, low surface-brightness galaxies, or dwarf galaxies. 
We discuss the consequences for such samples further in 
the next section. 

5. SUMMARY AND DISCUSSION 

We have investigated the clustering of dark matter 
halos as a function of several internal halo properties, 
namely formation time, concentration, and occupation 
number. We have confirmed that halo clustering is a 
function of halo formation time and have shown that the 
effect is scale-dependent. We have also demonstrated 
that halo clustering is a strong function of halo concen¬ 
tration, and that the strength and sign of this trend is 
a function of mass. Of relevance to studies of galaxy 
clustering statistics, we also find the clearest indication 
yet that host halos of fixed mass cluster in a way that 
is dependent upon the number of subhalos that reside in 
them. Our primary results can be summarized as follows. 

1. Halo clustering is a strong function of formation 
time for fixed mass halos. This effect strengthens 
with decreasing halo mass and with decreasing sep¬ 
arations, and is an increasingly strong function of 
mass as halos become less massive than M*. These 
results are in broad agreement with the recent re¬ 
sult s oflSheth fc Tormenl (120041 . iGao et alJ (1200511 
and lHa.rker et all (1200011 . 

2. We have presented the first definitive measurement 
showing that the clustering of dark matter halos is 
a function of halo concentration. This effect is a 
strong function of halo mass, and can be character¬ 
ized over a range of mass and redshift as a function 
of halo mass scaled by the typical collapsing mass, 
Mvir/M*. Below M v i r /M*, halos of high concen¬ 
tration are more clustered than halos of low con¬ 
centration and this trend strengthens with decreas¬ 
ing halo mass. For halos more massive than M*, 
the trend changes sign, and halos of low concen¬ 
tration become more strongly clustered than their 
high concentration counterparts. 


3. The dependence of halo bias on concentration, 
mass, and redshift can be parameterized in a simple 
way: b(M,c\z) = b(M\z)b Cvir (c\M/M*). We pro¬ 
vide a fitting function for b Cvlr (c|M/M*) that can 
be used to estimate the importance of these effects 
in various regimes. In §01 we demonstrate how this 
relative bias can be incorporated into a halo model 
formalism and discuss the effects of concentration- 
and formation time-dependent bias on estimates of 
matter and galaxy correlation statistics. 

4. We c onfirm a n d up d ate earlier r esults 
fe.g.. IGao et all [2004 IZentner et all 1200 j 
Ivan den Bosch et ali l2005t iTavlor fc Babu l 1200 51 
that the occupation number of satellite halos is 
strongly correlated with both concentration and 
formation time. We present the first detection of 
a trend between clustering and halo occupation, 
showing that at high mass, high-occupation 
(late-forming) halos are more clustered than their 
low-IV counterparts. 

|Shetfi&Tomie3 120041 . IGao et all 1200.511 and 
lHa.rker aTT if2f)06ll have emphasized that the trend 
with formation time indicates that using the halo model 
to estimate clustering can be problematic. We have 
quantified this explicitly, by investigating how these 
trends extend to two variables that are directly relevant 
to such calculations, namely concentration and halo 
occupation. There is a weak indication that the trends 
we see with concentration and halo occupation are 
slightly stronger than would be predicted simply by 
their dependence on formation time and the trends with 
formation time itself, however, this may be do to the 
larger measurement error in formation time. Larger 
simulations will be required to determine to higher 
accuracy whether the strength and nature of the trends 
with concentration and occupation have features that 
are not represented by the global correlations between 
these variables and formation time. 

As we emphasized in § 0] the dependence of clustering 
on concentration implies corrections to halo model calcu¬ 
lations of the dark matter power spectrum, as well as cor¬ 
rections to halo model calculations of galaxy correlations. 
These results prescribe caution when using the stan¬ 
dard halo model assumptions to calculate galaxy clus¬ 
tering statistics and infer cosmological parameters pre¬ 
cisely, but it is not clear that they will have a large effect 
for samples that are selected by mass or, as in observa¬ 
tional samples, by luminosity. Thi s is especially t rue for 
galaxies around L*. For example. lYoo et, a! ] 12001 and 
IZentner et all (|2f)05H have both shown that shuffling the 
host halos of such a sample results in less than about 5% 
effects in clustering statistics, albeit in relatively small 
volumes. We expect that the trends we have shown here 
will have stronger effects on high- or low-mass samples 
(when compared to M*) that are selected on some prop¬ 
erty that is directly connected to either formation time, 
concentration, or the number of satellite galaxies, and we 
speculate on a few of these below. 

The fact that the clustering of low-mass halos is 
strongly correlated with formation time may have inter¬ 
esting implications for the so-called “void phenomenon”, 
the tendency of low-mass galaxies to avoid the voids de¬ 
fined by larger galaxies. The currently-favored ACDM 
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model predicts substantial mass in voids, so the absence 
of galaxies in these regions has been suggested to be a 
potentially ser ious problem for the prevailing paradigm 
llPeebles(l200lD . Whether this is inde ed a problem for 
these models isn’t clear; for example, Mathis & White 
( 2002 1 have investigated the clustering of low luminosity 
galaxies in LCDM simulations combined with a galaxy 
formation model and found that all galaxi es avoid ed the 
voids defined by the brighter galaxies. iBenson et alJ 
12001 found a similar result but emphasized that more 
data was needed t o full y test the issue. Recently, 
iFurlanetto Kr. Pira.nt l|200(fl compared predictions for void 
sizes based on the excursion-set formalism with obser¬ 
vations from SDSS, and saw indications that the ob¬ 
served voids are somewhat bigger than the model pre¬ 
dicts. A related piece of evidence for differential cluster- 
in g of l ow-lumino sity galaxies has been emphasized by 
I Tull v et all iflOOl . namely, the difference between the 
luminosity function in clusters, which rises steeply to¬ 
ward low-luminosity dwarf galaxies, and the luminosity 
function in voids, which is substantially shallower and 
appears to be entirely bereft of a dwarf galaxy popula¬ 
tion. 

It h as been suggested dBullock. Kravtsov, & Weinberd 
l2rionlf that the discrepancy between the abundance of 
dwarf satellites observed in the Local Group and the 
abundance of relevant dwarf mass dark halos (u max < 
50 kms” 1 ) expected by CDM can be resolved by sup¬ 
pressing galaxy formation in halos that form after the 
universe is re-ionized. This would bias luminous dwarf 
galaxies to be associated with late-forming, small dark 
matter halos. As we have shown, the clustering trend 
with formation time is quite strong in low-mass halos 
of this type. If the photoionization significantly affects 
galaxy formation in dwarf-size galaxy halos, the dwarf 
galaxies they host would be substantially more clustered 
than typical halos in the same mass range, and this 
trend can b e enhanced b y add itional env ironmental fac¬ 
tors llKravtsov. Gnedin, fc Klvpiiil2004b l. If we extrap¬ 
olate the results of Eq. @ to M ~ 10 -3 Af*, this implies 
that the correlation length for the earliest forming quar- 
tile should be about a factor of 4 higher than that of the 
typical halos of that mass. This may provide a natu¬ 
ral explanation for the lack of observed dwarf galaxies in 
voids. 

More generally, any astrophysical effect that biases 
small galaxies to lie in early-forming halos would pro¬ 
duce the same effect. There is observational evidence 
that star formation timescales are quite long (~ lOGyr) 
in small galaxies and relatively short (~ lGyr) in 
large galaxies (e.g..lSearle et al.ll973H.Iuneau et al!2005t 

IWillmer et alJl2005l Weiner et al.. in preparation). One 

implication of this is that the total stellar mass accumu¬ 
lated in low-mass halos will be much more sensitive to 
halo formation time compared to high-mass halos. Such 
a formation-time bias in the observed properties of low- 
mass galaxies may not only help to explain the void 
phenomenon, but will likely be important in attempts 
to construct conditional luminosity functions that ex¬ 
tend to low-luminosity galaxies. The strong clustering 
of early-forming low-mass halos may also play a part in 
the observed trend that dim red galaxies are substan¬ 
tially more cluste red than their intermediate luminosity 
counterparts le.g. lNorberg et al!2f)02HHogg et alT2 0031. 


although the basic effect can be exp lained if the the ma - 
jority of these galaxies are satellites dBerlind et al.l2005l) . 
Of course, any model which forms galaxies using forma¬ 
tion histories from a N-body simulation will include the 
halo clustering effects implicitly, but the extent to which 
this effects galaxy clustering will depend on the efficiency 
of galaxy formation in low-mass halos. 

There is another interesting implication of our re¬ 
sults. Numerous observational signatures indicate 
that the central densities and concentrations of low 
surface-brightness (LSB) galaxies are significantly 


(e.g. 

van den Bosch et al. 2000: IDebattista & Sel 

wood 

2000 

[Keeton 200ll van den Bosch & Swateto 

2001; 

Alam et al. 2002 

; Zentner & Bullock 2002, 

2003; 

McGaugh et al. 

2003: van den Bosch et al. 2003a: 

Kuhlen et al. 2005 

Simon et al. 2005!). The dependence 


of clustering on halo concentration may also have 
implications for the interpretation of the concentra¬ 


tions of LSBs. LSBs have been shown _to be n otably 
less clustered than typical galaxies (IktoetoiL 11994 
iRosenbaum fc Bomansll2004l but see also lPeebleal200lib 
This might imply that they reside in a biased population 
of late-forming, low-concentration halos. If the typical 
host halos for these galaxies are around ~ 0.01AL*, and 
LSB galaxies are about 60% less biased than typical 
galaxies, this would imply that they have concentration 
values that are about ler below the mean. This would 
reduce the tension between the predicted and observed 
concentrations of the halos hosting these galaxies. 

Finally, our results also have implications for es¬ 
timates of the cluster mass function using optically- 
selected cluster samples. For a richness-selected sam¬ 
ple of clusters, or any sample where the primary ob¬ 
servable that clusters are selected on is correlated with 
formation time or halo concentration, mass estimates 
from “self-calibration”, using the clustering of clusters 
(iMaiumdar fc Mobil 1201)4 iLima fc HiJ [20051. may bias 
results towards higher masses. Note that because these 
effects correlate with concentration, and probably halo 
shape and merger history, they could affect SZ- or X-ray- 
selected samples as well. Similar biases could potentially 
manifest in weak lensing measurements, and could lead 
to over-estimates of the mass-to-light ratios in methods 
that use clustering to constrain t he H OD or t he con¬ 
dit iona l luminosity function (e.g. Ivan den Bosch et alJ 
i2oo.m They could also impact clustering-based mass 
estimation for other populations that live in high M/M* 
halos, e.g., high-redshift bright galaxies or quasars. 

Further work is needed to make quantitative estimates 
of these effects at high masses, but this trend may be 
measurable in current optical cluster samples (e.g., from 
the SDSS, Koester et al., in preparation). The trend be¬ 
tween halo occupation and formation time implies that 
clusters with a given number of galaxies will be a mix of 
high-concentration, early-forming, high-mass halos and 
low-concentration, late-forming, low-mass halos. If one 
can find an observable measure that correlates with for¬ 
mation time (for example, the difference between the lu¬ 
minosity of the first and second brightest cluster galax¬ 
ies, or the star formation histories of the satellite galax¬ 
ies), these correlations make specific predictions. At 
fixed richness or N ga \, the early-forming sample should be 
more massive and more concentrated (because concentra- 
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tion varies much more slowl y with halo mass than with 
formation time: c ~ m 01 . iBullock et ablfedOl b but less 
clustered than expected for typical halos of that mass. 

Our results indicate that the Universe is somewhat 
more complicated than our simplest models. However, 
the complication should be viewed as an opportunity 
rather than an obstacle, as we can potentially learn 
a great deal about details of galaxy formation in ha¬ 
los and their evolutionary histories from the trends dis¬ 
cussed in this work. The cu rrent an d upcoming large 
galax y survey s (e.g., SPSS. JAdelman-McCarthv et, al.l 
UmI DEEP2. ICoilet alJl2004t DE S. IAbbott et al.H2005T 
LSST, and SNAP. IAldering et alJl200ll should be able 
to accurately evaluate such effects and test the predicted 
trends. 
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